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; I Abstract. This paper introduces and analyzes a particular class of 

C^ Polya urns; balls are of two colors, can only be added (the urns are 

^5 said to be additive) and at every step the same constant number of balls 

^^ is added, thus only the color compositions varies (the urns are said to 

C^ be balanced). These properties make this class of urns ideally suited for 

Cn analysis from an "analytic combinatorics" point-of-view, following in the 

^_^ footsteps of Flajolet et al. [2]. Through an algebraic generating function 

to which we apply a multiple coalescing saddle-point method, we are 
able to give precise asymptotic results for the probability distribution of 
the composition of the urn, as well as local limit law and large deviation 
bounds. 
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Dedicated to the memory of Philippe Flajolet. 
1 Introduction 



(T^ A Polya urn is an urn which contains balls of two colors (black and white), 

^D and which is coupled with an initial configuration and a set of evolution rules. 

^^ A step then consists in randomly picking a ball from the urn, placing it back, 

. . and depending on its color, adding a fixed number of black and/or white balls. 

J> The question is: what does the urn look like after a large number of steps? This 

simple process has turned out to be extremely versatile, and has been used to 
model many different phenomena, such as population growth, epidemics, tree 
Cd structures in computer science (BST, (a, 6)-trees), electoral campaigns, etc. 

This paper analyzes a class of balanced additive urns. Balanced urns are 
urns for which, at every step, the same constant number of balls is added. This 
property allows us to resort to a combinatorial treatment: enumerating all config- 
urations using generating functions. Such an approach, introduced by Flajolet 
and his coauthors 4J, is a departure from previous probabilistic methods. In 
additive urns, no ball is ever removed and the urn's size is strictly increasing. 
We specifically consider a class of balanced additive urns which has algebraic 
generating functions. 
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Through the use of analytic combinatorics l6] , we obtain precise probability 
results, including a Gaussian limit law with rate of convergence, a local limit 
law and large deviation bounds. Our analysis makes use of a multiple coalesc- 
ing saddle-point method, which is not classical. As previously thought, analytic 
combinatorial methods can provide a wealth of valuable information that seems 
to be new. In addition, our results seem to reinforce the idea of a general theory 
for additive balanced urns. 

This paper comes as the continuation of Flajolet's work on urns [Sj and |4]; 
these two papers were on subtractive and triangular urn models. Of course, this 
topic has been thoroughly addressed through probabilistic means. We can cite 
introductory books on urns [TT] [T3] and an article [T]; on the topic of limit 
distribution for urns, Janson's papers are a reference [5] [S] [TU] as is Smythe's 
[TS]; and on the topic of large additive urns, [5]. From an analytic point of 
view, Hwang et al.'s paper [7] considers '' diminishing" urn models. And in direct 
relation to the present paper, we have obtained similar results [13] on a class of 
additive balanced urns which is linked to some family of fc-trees |15j . 

In Sect.[2j we lay the ground work by introducing Polya urn models as well as 
the main result of [4 on the isomorphism between balanced urns and differential 
systems. Then, in Sect. [3] we introduce our urn class, which can be viewed as a 
population growth model. We establish that the generating function is algebraic 
and provide our first results on mean and variance. Section [4] contains the main 
theorems of the paper regarding the limit distribution of our urn class. Finally we 
state the propositions and lemmas required by the proof of the main theorems, 
and give some extensive details on the part involving multiple coalescing saddle- 
points. 



2 Urns and Differential Systems 

In Polya's classical urn model, we have an urn containing balls of two different 
colors, black balls (b type) and white balls (w type). This system evolves with 
regards to particular rules (at each step, add and/or discard black and/or white 
balls), and these rules are specified by a 2 x 2 matrix 

,j a, deZ, 6, ceZ^o- (1) 

We start with an initial configuration (ao,feo)- At step 0, the urn contains oq 
black balls and 6o white balls. The evolution between steps n and n -I- 1 is now 
described. We uniformly draw a ball from the urn, we look at its color and we 
put it back into the urn. If the color is black, then we add a black balls and b 
white balls; if the color is white, we add c black balls and d white balls. 

Definition 1. The urn [w is said to be balanced if the sums of its rows are 
constant, that is ifa + b^ c + d. This parameter is called the balance of the urn, 
and denoted by a. 
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Remark 1. At each step, we add a balls in the urn. So, starting with oq + fog 
balls in a balanced urn, we know that the total number of balls after n steps 
will be ao + 60 + cm. This balanced condition is the key requirement for using 
Flajolet-Dumas-Puyhaubert differential systems. 

Definition 2. An urn is additive if all the coefficients in its rule matrix [w are 
strictly positive, i.e., a,b,c,d > 0. 

Remark 2. No balls are ever removed from an additive urn. In our study, we 
focus on balanced additive urns. 

The papers [^ and [1] introduce a new analytical and combinatorial approach 
to the study of these balanced urn models. The crucial starting point is an 
isomorphism theorem between the rules of an urn and a differential system. We 
briefly recall these results. 

Definition 3. A history of length n is a sequence of n steps, obtained by n 
successive draws from the urn. The exponential generating function of histories 
of the urn Hv is 

H{x,y,z;ao,bQ) = ^ Hn{aQ,bo;i,j) x'y^ — , (2) 

n,i,j 

where Hn{ao,bo]i,j) is the number of histories of length n starting at the con- 
figuration (aQ,bo), and ending at a configuration {i,j). We will shorten this to 
H(x,y,z) when there is no ambiguity on the initial configuration (ao,feo)- 

Theorem 1 (Flajolet Dumas Puyhaubert). We associate to the urn (pi), 
with the balanced condition a + b = c + d, the following differential system (we 
denote t- differentiation by a point, X{t) — -^X[t)): 

{ X = X''+^Y^ 
1 Y = AT^y'+i . 

Let Xq and j/o be two complex variables such that x^yo ^ 0. Let X{xo,yo,t) 
and Y{xo,yojt) be the solutions of the differential system with initial conditions 
X{xQ,yo,0) = Xq, Y(xo,yo,0) = ^q. Then the generating function of histories 
is given by 

H{xo, yo, z) = X{xo, yo, 2)"° Y{xo, yo, z)''° . (3) 



3 Preferential Growth Urns 

As previously mentioned, [S] has signaled that analytic methods can be used in 
the treatment of urn models. Since then, a general theory has been described 
for several urn models: for two-color balanced subtractive models and triangular 
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models [4], as well as for some particular unbalanced models [7|. The extension 
of these methods for all additive urn models is an open problem. This paper 
presents a full asymptotic study for some restriction of the balanced additive 
models. More specifically, here we present a two-parameter urn model, whereas 
the entire class of balanced additive models is described with three parameters. 
Indeed, knowing the four matrix coefficients and the balance hypothesis, one 
parameter is redundant. Our class is studied through its histories generating 
function, which is in fact algebraic in our case. 

Definition 4. The class of preferential growth urns denoted by A{a, /3) is de- 
fined by the two-color balanced matrix 

^" ^ „ I , with a > 0, /3 > . (4) 

a a-\- pj ^ ' 

Example 1. A first example -4(1, 1) corresponds to the following evolution rules: 

b^bbbw ,. ^, /2 1 

corresponding to the urn _, _ 

w — > b www V 

If we pick a b ball, we replace it by three b (counting the ball we drew and are 
now replacing) and one w. If we pick a w ball, we replace it by one b and three 
w. In this particular example, we can see a model of population growth with 
two types of individuals (b and w). Every individual has three children and two 
of them are of the same type as their parent. Every individual encourages its 
own type with the ratio 2/1. From this example, we choose to name our class 
preferential growth urns. 

Remark 3. Here are some characteristics of this urn class. The balance of A{a, /3) 
is a := 2a-\-j3. The dissymetry index is defined by p := a— 2a = (i—{a+f5) = —a. 
The two eigenvalues of the matrix are —p and a. Besides, the ratio p of these two 
eigenvalues is p := ^ = 2a+0 ^ 5- It is a small urn so, from the probabilistic 
analyses of Smythe [TB] and Janson [S] , we already have a Gaussian behavior for 
the limiting distribution of the two colored balls in the urn. 

In this paper, thanks to analytic combinatorics, first we obtain concrete and 
precise asymptotic results with rate of convergence, local limit laws and prob- 
abilities of large deviations; second, the analytical proof deals with a multiple 
coalescing saddle-point method; and third, this is the first step towards a full 
study of additive balanced urns. 



3.1 An Algebraic Generating Function 

We start the urn process A{a, /?) with no black balls b, and one white ball w. So, 
in the following, (ao, 60) = (0, 1). The main result of this subsection is exhibiting 
the algebraic nature of the generating function of histories. 
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Theorem 2. The bivariate generating function of histories H{x, z) := H{x, 1, z) 
is algebraic, and the following polynomial in y cancels itrj 

{z-A^B,)y^"+^+B,y''+A^O, (5) 

1 x^°' — 1 
where A := = a~ and B^. := . 

2a + 13 a + /3 

Proof. The exponential generating function of histories is 

H{x,y,z) = Vi7„(i,j)a;*y— , 
'^ — ' nl 

and the differential system associated to the urn is 

i X = x2"+i y/^ 

Let (Ar(a;o, 2/07^)1 ^(2^07 yo,i)) be the associated solution, then thanks to Theo- 
rem n] and flo = , feo = I7 we get H{xo,yo,) — Y{xn,yo,z). Rewriting the 
system, 

^ =x"y^ = --^, so, x-"-y-" = x„-"-j/o-". 



Then, 

After a z-integration and naming the integration constant K(xo,yQ), 

l^y-(2«+,) ^ "o'"-;/o'V -(.+ffl ^ _(, _ Kixo,yo)) ■ 

2a + p a + p 

We can set yo = 1 (the variable is useless because of the balanced condition. 
Indeed, in H, the non-negative coefficients oix^y^z^ appear when i+j = ao+6o + 
(Tn)., then we get the result. Thus, the generating function H(x, z) := H{x, 1, z) 
is algebraic, as solution of a polynomial of degree 2a + f3. D 



3.2 Mean and Variance 

From the polynomial equation ([5]), we can directly extract some precise infor- 
mation concerning the expression of the mean and the variance of the urn com- 
position. It is also possible to get all moments. 



^ In this paper, we only treat the case {ao,bo) = (0,1). The general case 
H — X'^'^Y''° derives directly from this study, because X and Y are bound: 
H{x, y, z) = {x-°' - y-" + y-")-"«/° y'"' . We focus here on the function Y. 
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Proposition 1. Let X„ be the random variable counting the number of black 
balls b in the urn Op at step n. Then we can express mean and variance asymp- 
totically: r] 

V(X„) = "/^'"I'^f^ i + Ofn^^) whereT[x):^ r t^^^e-'dt. (7) 
[a + PY \ J 7o 

Proof. The following techniques are described in [5]. We differentiate the equa- 
tion ([5]) with regards to x, then we set x = 1; we use the asymptotic expansion 
form and normalize by the asymptotic expansion of (1 — az)~^''^ —: H{1, 1, z). 
For variance, we differentiate twice and we use the same technique. It can be 
done for all moments. Besides, it is possible to express any term in the asymp- 
totic expansion, thanks to Dr. Salvy's Maple package gf unP] D 



4 Asymptotic Results 

The aim of this section is to obtain properties on the limit distribution of the 
number of black balls in the urn. Our combinatorial point of view implies the 
study of z-coefficients of the generating function H{x,z). 

4.1 Probability Expression and the Case a; = 1 

Lemma 1. The z^ -coefficient in the generating function of histories H{x,z) 
can by expressed by a contour integral (with a contour around the origin): 

\z"-]H{x,z)^ (p a^{w)hx{w)"-^'^dw , where (8) 

2i7r ./ 



hx{w) 



a + /3 ^ ' \ a-F/3 ^ ' ) 



-1 



(9) 



a,{w) = (1 - w)"+^-' (a;-" - 1 + (1 - wf) . (10) 

Proof. We start with the polynomial ^ which cancels y{x,z) := H{x,z). We 
use a Cauchy formula to express the z^-coefhcient, 

nH(x,z)=2/„(x) = ^/^dz. (11) 



^ Exact expressions for mean and variance are also computable from this, see [B] 

(Prop.III.2, p. 158 and 728 ). 
^ available on http://algo.inria.fr/libraries/. 
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Then, by a Lagrange inversiorrj between z and ^(z), using the equation ([5]), 

z = a-i + B, - y^'^ (a-i + B,y") . (12) 

At this step, it is not yet possible to apply a saddle-point method, because the 
saddle-points are at oo. Again, we change variables, and set if = 1 — i/~^ to get 
the result. D 

From this lemma, we have to evaluate this integral in order to have an ex- 
pression of \z'^\H{x^z). It will be necessary for describing the limit probability 
distribution. Indeed, our main goal is to understand the behavior of the proba- 
bility generating function noted pn{x), 

Pnix) = J:FiX^ = ^).^= \ll\lf^'J (13) 

i>0 [ \ \ ■ ) 

First, we focus on the evaluation of [z"]iJ(l,z). In the special case a: = 1, we 
can solve exphcitly the equation (Isj). We have H{1, z) = (1 — crz)"^/'". Thus, we 

know with classical analytic combinatorics that 2/n(l) ^ "■ ^F (cr^^) a"^. For 
X ^ 1, we don't have access to the explicit solution, therefore we use a saddle- 
point method for the general case. In the Appendix |^ the simple case x = 1 
is treated with this method. It is useful to guess the right normalization for the 
variables and the contour for the general case. For illustration. Fig. [l] shows the 
behavior of hi(w) and how to choose the contour. 

4.2 Limit Theorems 

Here is the main result of the paper. It is described in the three following theo- 
rems which concerns the asymptotic distribution of the balls in the urn. Proposi- 
tion [2] expresses the asymptotic expansion of the probability generating function 
Pn {x) . The refined saddle-point analysis is the core of the proof of this propo- 
sition. Then, the proofs of the theorems are based on this proposition and on 
Quasi-Power theorems from H. -K.Hwang, recalled in [5] (p. 645, 696, 700). 

Let X„ be the random variable counting the number of black balls in the urn 
A{a,/3) after n steps. 



Theorem 3. (Gaussian limit law) The random variable X^ has mean /i„ and 
variance v^, and the normalized random variable "^" converges in law to the 

standard normal law Af{0, 1), with rate of convergence O I —i= ) , 



Pn = E(-'^n) = fJ-n + o{n) , vl = W{X„) = v^n + o{n) , 
I ^" ~ ^" ^ A = ${t) + O (^\ , where <?(t) = -L f g-^di;. 



For details on Lagrange inversion, see |6] (Appendix A. 6, p. 732). 
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Fig. 1. Lejt - Double saddle-point for h\{w) for the urn .4(1, 1). The three peaks cor- 
respond to zeros of the denominator of hi{w) which are 0, 1 — j and 1 — j^ , where j is 
the third-root of unity exp(2i7r/3). Right - Diagram from a top view of the three poles 
and the double saddle-point in 1, with the three-parts contour Ci, C2, C3. 



with 



a{2a + P) 
a = — and 

a + P 



v^ = 



a3(2a-f /3) 



Theorem 4. (Local limit law) We denote Pn,k — PjA'n = k}. The distribu- 
tion of Xn satisfies a local limit law of Gaussian type with rate of convergence 



sup 



l^V"-Pri,LMn+ti^\A^J 



1 



-t^/2 



/27r 



€ 



1 



The last theorem concerns large deviation bounds. 

Definition 5. (Large deviations property) Let (a„) he a sequence tending to 
infinity. A sequence of random variables (A„) with mean EAr„ ^ /iftn? satisfies 
a large deviation property, relative to the interval [tQ,ti\ containing fi, if there 
exists a function W(t) such as W{t) > for t ^ jjl, and for n large enough, 

Vt, ta<t<^i, — logP(X„ s^ tan) = ~W{t) + o(l) (left tail) , 



Vt, n<t< ti, — logP(X„ ^ tan) = -W{t) + 0(1) (right tail) , 



W(t) is called the rate function, and a„ is the scale factor. 
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Theorem 5. (Large deviations) For S, such as < ^ < 1, the sequence of ran- 
dom variables (X„) satisfies a large deviation property relative to the interval 
[tg, ti], with a scale factor n, and a rate W{t), with Xq — ^, Xi = 2 — f , 

, x'(2;o) , , X'{xi) 

to = Xo—. — r and ti ^ xi—. — ^, 

W{t) = - min log (^^\ , where x(x) = x^ e"^ i"(^)' . 

x<£[xo,xi\ \ X^ J 

The main proposition in the proof of the theorems is the asymptotic expression 
of the probability generating function Pn{x), obtained by the division of yn{x) 
by yn(l), where yn{x) denotes the z"-coefRcient in H{x,z). 

Proposition 2. For x = e'"/^"^ ^/jg expression of the probability generating 
function pn{x) is, asymptotically when n — > oo, 

p„(x) = exp ( ^iuy/n - ^u^ \ (l + O (un^ 2(2a+/3) j j ^ (14) 



a{2a + P) , 2 a^(2a + /3) 



,,' = 



a + P (a + 13) 



2 



with 



Remark 4- This expression (JMl) oi Pn{x) is valid for x in a neighborhood of 1, 
with the radius n~^'^ (which will give the Gaussian limit law); for a; in a real 
segment centered in 1 of length n~^/^ (which will give large deviation); for x in 
the unit circle, |a;| = 1 (which will give a local limit law). 

4.3 Proof of Proposition [2] 

Proposition 3. For all x, hx{w) has a simple pole in w = 0, and 2a + (3 — 1 
other simple poles. Saddle-points are in w ~ 1 with multiplicity a + /3 — 1, and 
in w = 1 — J, with 7 any ath-root of (1 — x^°'). 

Proof. In order to find saddle-points, we look at zeros of the derivative of hx, 
h'^{w) = ~ahx{wf (1 - w)"+^'^ (a;-" - 1 + (1 - wf) . 

Remark 5. In our case, when x ^ 1, we have a + 1 different saddle-points: one 
"huge" at w = 1, and a secondary saddle-points depending on the x value. 
When a; is in a neighborhood of 1, the a saddle-points will merge at the point 
w = 1. We have to deal with a -I- 1 coalescing saddle-points. It differs from usual 
saddle-points method because it is no longer possible to treat each saddle-point 
independently. We need to consider all saddle-points together in order to find 
the asymptotic expansion. 
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Fig. 2. Left - Graph of |/ia;(w)| for the urn ^(3,2). There are a — 8 poles, and 4 
saddle-points at point 1 and at 1 — (l — x~^) . Right - Multiple saddle-points for 
hxiw). The peaks (poles) corresponds to the zeros of the denominator of h^iw). The 
stars represent the different saddle-points. We illustrate in red the integration contour 
around 0. 



We expect a Gaussian law (thanks to probabilistic results), so we use a renor- 
malisation for x of order n^^''^ . For the renornialisation of w, we use the same 
scale as previously in the particular case x — 1 (see Appendix [A| . So the nor- 
malizing factor is n~^''^ . 

The general idea for the choice of the contour is the same as in the previous 
simple case x — 1. We will use two segments beginning in the saddle-point area 
va w = 1 and they have to follow the descents of the surface. This descent has 
to go between the pole and the first closest pole. The segment will make an 
angle of ^^"^^ ' = tt — | ^ , as we can see on the right part of Fig. 2 

Definition 6. The integration contour is described in three parts. We use the 
following parametrization for the segment Ci. For t € [0, n^], 



t 
n 



1/a 



(15) 
(16) 



The same parametrization is used for the segment C2, with an angle of — tt - — - . 
The circle part C3 completes the contour. 



Proposition 4. There is an asymptotic expansion in n for hx{wY 



h^{w)" = e 



_ tJ.iu^—\u^—t 



exp rii:i(2»+3)y„2(2„+/3) _)_ o (n~^/^ [iu^ + "^) ) ) 



10 
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with 

a + /3 {a + pf 

Remark 6. Notiee that this expansion is valid for all t and u. Besides, we obtain 
the same expansion using for x the expression x — 1 + iun^^''^ — u^n~^. Then, 
it will be possible to use this expansion for all \u\ < 1, then we will have an 
expression for |a:| < ^ uniformly, which is a necessary hypothesis for Hwang's 
theorem on limit law. We will use x real near from 1 for large deviations theorem. 
The expression with x = e™/^ implies |a;| = 1 for the local limit law theorem. 

Lemma 2. [Neglect the tails] The Ci- contribution is asymptotically equivalent 



on both intervals, for t G [0, ri^] and for i G 0, 7i"+i 
O ( exp I — n^+T ) ), so it is exponentially negligible. 



The error term is 



Proof. On the path Ci, the function hx{t) is strictly decreasing for t > n^+i . D 
Lemma 3. [Central approximation] The Ci- contribution can be written 



1 



2i7r 



-e^'^n 



exp (fiiuy/n- '^u^j / '^^ i^^^'^e"* (l+t^ofun^"^)) dt. 



Proof. We use Lemma [2] to restrict the interval. Then we can apply the asymp- 
totic expansion of Proposition |4j Besides, we have to know an asymptotic ex- 
pansion for ax{w) on this path, and we have 






Lemma 4. [Complete the tails] The integral of t ^'°'e * on segment [0, n^+i] 

is equivalent to the same integral on interval [0, oo[. 

The error term is O (exp I —n^rr \\ so it is exponentially negligible. 

Lemma 5. The Ci and C2 contribution can be written 

''^exp(^l^u^/n~'^u'^\(l + o(un^^jj. (17) 



a 1 



Proof The work on segment Ci is the same for segment C2. Adding the two 
contribution, we get a (e*'^/'^ — e^*'^/'^) factor. Thanks to Lemma 4 the integral 
part of calculus reduces to the simple Gamma factor F (^^). □ 

Lemma 6. The circle part C3 is exponentially negligible. 



11 
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Adding all the contributions, thanks to Lemmas [5] and |6) we obtain a global 
expression of yn{x), 

ynix) = -q-prni~^expi^iuy/n-—u^j(l + 0(u7i^'^jj . (18) 

We can express Pn{x) with a: in a neighborhood of 1 with radius n^^'^, and we 
obtain an expression suited to Quasi-Power Theorems, 



Pn{x)^(x'^exp('^\nix)A) (l 



-f 



l + 0[n^jj . (19) 

For the three main theorems, we use this expression which satisfies the hypothesis 
for applying H. -K.Hwang theorems: Quasi-Power Theorem, Quasi-Power Limit 
Theorem and Quasi-Power for large deviations (in [61, p. 645, 696, 700). 



5 Conclusion 

Through the use of analytic combinatorics, we obtain precise probability results, 
including a Gaussian limit law with its rate of convergence, a local limit law and 
large deviation bounds. For the first time, this kind of probabilistic results on 
additive balanced urns are obtained from an analytic point of view. As previ- 
ously thought, analytic combinatorial methods can provide a wealth of valuable 
information. In addition, our results seem to reinforce the idea of a general the- 
ory for additive balanced urns. In this way, the work on urns linked to fc-trees 
in [Hj use an other subclass of additive balanced urns, and similar probabilistic 
results were obtained. The natural next step is having a complete understanding 
of the whole class, following ideas of Flajolet et al. in [4j. 

Acknoviflegment. I warmly thank my advisor and mentor, Philippe Flajolet, 
for guiding me and introducing me to research and especially to analytic com- 
binatorics. This paper is dedicated to him. In addition, I'm grateful to Jeremie 
Lumbroso for his helpful remarks. 

This work was supported by the ANR project 09 BLAN 0011 Boole and the 
ANR project 10 BLAN 0204 Magnum. 



References 

1. Bagchi, a.. Pal, A. Asymptotic Normality in the Generalized Polya-Eggenberger 
Urn Model, with an Application to Computer Data Structures. SIAM Journal on 
Algebraic and Discrete Methods 6, No. 3, 394-405 (1985) 

2. Banderier, C, Flajolet, P., Schaeffer, G., Soria, M. Random Maps, Coa- 
lescing Saddles, Singularity Analysis, and Airy Phenomena. Random Structures & 
Algorithms 19, No. 3/4, 194-246 (2001) 



12 



Fully Analyzing an Algebraic Polya Urn Model B. Morcrette 



3. Chauvin, B., Pouyanne, N., Sahnoun, R. Limit Distributions for Large Polya 
Urns. Annals of Applied Probability 21, No. 1, 1-32 (2011) 

4. Flajolet, p., Dumas, P., Puyhaubert, V. Some Exactly Solvable Models of 
Urn Process Theory. Discrete Mathematics & Theoretical Computer Science Pro- 
ceedings AG, 59-118 (2006) 

5. Flajolet, P., Gabarro, J., Pekari, H. Analytic Urns. Annals of Probability 
33, No. 3, 1200-1233 (2005) 

6. Flajolet, P., Sedgewick, R. Analytic Combinatorics. Cambridge University 
Press (2009) 

7. Hwang, H.-K., Kuba, M., Panholzer, A. Analysis of Some Exactly Solvable 
Diminishing Urn Models. In: 19th Formal Power Series and Algebraic Combina- 
torics, Tianjin China (2007) 

8. Janson, S. Functional Limit Theorems for Multitype Branching Processes and 
Generalized Polya Urns. Stochastic Processes and their Applications 110, 177-245 
(2004) 

9. .Janson, S. Limit Theorems for Triangular Urn Schemes. Probability Theory Re- 
lated Fields 134, No. 3, 417-452 (2005) 

10. Janson, S. Plane Recursive Trees, Stirling Permutations and an Urn Model. 
Discrete Mathematics & Theoretical Computer Science Proceedings AI, 541-548 
(2008) 

11. Johnson, N. L., Kotz, S. Urn Models and Their Application. John Wiley & 
Sons (1977) 

12. Mahmoud, H. Polya Urn Models and Connections to Random Trees: A Review. 
Journal of the Iranian Statistical Society 2, 53-114 (2003) 

13. Mahmoud., H. Polya Urn Models. Chapman-Hall/CRC Press (2008) 

14. Morcrette, B. Combinatoire analytique et modeles d'urnes, Master's thesis, 
MPRI - Ecole Normale Superieure de Cachan, INRIA Rocquencourt (2010) 

15. Panholzer, A., Seitz, G. Ordered Increasing fc-trees: Introduction and Analysis 
of a Preferential Attachment Network Model. Discrete Mathematics & Theoretical 
Computer Science Proceedings AM, 549-564 (2010) 

16. Smythe, R. Central Limit Theorems for Urn Models. Stochastic Processes and 
their Applications 65, 115-137 (1996) 



A Appendix - Saddle-point method when a? = 1 

We detail here the evaluation of the contour integral y„(l) for the simple case 
a; = 1, 



I (i — w) dw , ^ 



2in 



Then, hi{w) = (1 - (1 - w)") ^ and ai{w) = (1 - w^^'^ We have now to 
evaluate the integral. This is classical saddle-point method. The derivative h'^ 
becomes zero only for one value w = 1. There is only one saddle-point. We will 
use two segments starting at w = 1 and descending along the surface, passing 
between the peaks (singular points of hi). The third part of the contour is a 
circle to join the two ends of segments. Figure [l] illustrates the three parts of the 
contour for the urn ^(1, 1). 

First we look at the parametrization of the two segments: 



13 



Fully Analyzing an Algebraic Polya Urn Model B. Morcrette 

- Ci : w = l+te^''^, with t e [0, L], 

- C2 ■■ w = l+te-^"^, with t e [0, L]. 

The two cases are similar. For Ci, the integral rewrites 

Adding the two contributions of the two paths Ci and C2, we get 

The expansion /ii(i)" = exp i^—nt'^ + O {nt^'^y) kads us to change variable with 
u — nt" . Thus, the integral rewrites 

/ -1 ^ = !^^/ u-i/^exp(-« + 0(u2n-i))du. (23) 

We have to adjust the parameter L which is the length of each segment. In 

order to use the expansion approximation, some conditions impose themselves : 

9 -1 

first, nL'^ -^ 00, and second, nL -^ 0. We decide to fix L ^ n^+i. Thus, 

nL'^ — 71"+^ and nL = n "+i . In this case, we complete the tail of the 
Gaussian approximation, 

u-i/'^e-"+°("'""')dti - / u-i/'^e-"du + 0(n-i) (24) 
Jo 

f-OO 

~ / u-i/'"e-"du + 0(n-i) (25) 

Jo 

^ r(^)+0(n-). (26) 

To conclude with this illustration example, we need to deal with the third part 
of the contour, which is the circle part. We show that this part is exponentially 
small. Indeed, it is possible to stretch the segments Ci and €2- Thus, the radius 
of the circle part will grow to infinity and this integral part will be negligible. 
Besides, it is effectively possible to stretch the two segments because the function 
hi is strictly decreasing along the segments. Its maximum is at the saddle- 
point. On [0, n'^], we have h{{t) = -^^^"^(l + f^Y < 0. So, the evaluation 
of the integral on the segment [0, n""] is the same order as the evaluation on 
[0, n^'^'^^^']. Indeed, the majoration error is exponentially small. This error is 
about the order O (exp(— n^/'^'^"'"^^)), which is negligible. 

Gathering all parts, we obtain 

...^^-.(^)=^r(^) (1 + („-.)) m 

n 1-1/ct 
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